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DETAILED ACTION 



1. 



This action is responsive to communications: amendment filed on 9/27/2004. 



2. 



Claims 1-17 are pending in the case. Claims 1, 6, and 1 1 are independent claims. 



Claim Rejections - 35 USC §103 



3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 1, 6 and 11 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kostoff et al. (hereafter referred to as Kostoff), US 5,440,481 patented 08/08/1995. 

Regarding independent claim 1, Kostoff teaches determining a frequency of each word 
in each document in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff 
teaches creating a table of most frequently occurring words in the documents in fig. 2, table 1, 
col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 1 1 . Kostoff teaches determining a frequency of 
phrases in each document that could contain only words in a table in fig. 2, table 1, col. 4 lines 
50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches outputting the most frequently 
occurring words and most frequently occurring phrases as a dictionary in fig. 2 and col. 4 lines 



Kostoff does not specifically teach inputting a maximum dictionary size and limiting the 
dictionary to the inputted maximum dictionary size, such that the dictionary contains less than all 
words in the documents. However, Kostoff does acknowledge the importance and limitation of 



64-68. 
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memory size for storing a list of trivial words in col. 4 lines 44-45. This list is a precursor to the 
dictionary, however it teaches one of ordinary skill in the art at the time of the invention the 
relevance of memory storage size. Kostoff also teaches selecting a portion of the word and 
phrase dictionary in col. 5 line 59 - col. 6 line 64. Kostoff uses an example of selecting the 60 
most often repeated phrases. Kostoff notes that more or less than 60 most often repeated phrases 
may be selected at the discretion of the user. 

In light of these teachings of Kostoff, one of ordinary skill in the art at the time of the 
invention would have truncated the dictionary of Kostoff at the user inputted number of most 
often repeated phrases in the event the dictionary had to reside within a limited memory storage. 
The teaching of Kostoff of possible memory storage constraints having an impact on a list size in 
col. 4 lines 44-45 would have motivated and taught insight to the person of ordinary skill in the 
art at the time of the invention to have made this modification. It would have been obvious to 
one of ordinary skill in the art at the time of the invention to have discarded the less frequent 
terms below the population threshold inputted by the user because they would not have been of 
further use in determining the themes of the text to prepare it for clustering with other 
documents. Eliminating the unused terms would have desirably saved memory as seen in col. 4 
lines 44-45. Only the top set of words and phrases determined by the user would have been used 
and therefore it would have been obvious to have only retained those words and phrases in the 
dictionary. 

Regarding independent claim 6, Kostoff teaches determining a frequency of each word 
in each document in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff 
teaches creating a table of most frequently occurring words in the documents in fig. 2, table 1, 
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col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining a frequency of 
phrases in each document that could contain only words in a table in fig. 2, table 1, col. 4 lines 
50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches outputting the most frequently 
occurring words and most frequently occurring phrases as a dictionary in fig. 2 and col. 4 lines 
64-68. 

Kostoff does not specifically teach inputting a maximum dictionary size and limiting the 
dictionary to the inputted maximum dictionary size, such that the dictionary contains less than all 
words in the documents. However, Kostoff does acknowledge the importance and limitation of 
memory size for storing a list of trivial words in col. 4 lines 44-45. This list is a precursor to the 
dictionary, however it teaches one of ordinary skill in the art at the time of the invention the 
relevance of memory storage size. Kostoff also teaches selecting a portion of the word and 
phrase dictionary in col. 5 line 59 - col. 6 line 64. Kostoff uses an example of selecting the 60 
most often repeated phrases. Kostoff notes that more or less than 60 most often repeated phrases 
may be selected at the discretion of the user. 

In light of these teachings of Kostoff, one of ordinary skill in the art at the time of the 
invention would have truncated the dictionary of Kostoff at the user inputted number of most 
often repeated phrases in the event the dictionary had to reside within a limited memory storage. 
The teaching of Kostoff of possible memory storage constraints having an impact on a list size in 
col. 4 lines 44-45 would have motivated and taught insight to the person of ordinary skill in the 
art at the time of the invention to have made this modification. It would have been obvious to 
one of ordinary skill in the art at the time of the invention to have discarded the less frequent 
terms below the population threshold inputted by the user because they would not have been of 
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further use in determining the themes of the text to prepare it for clustering with other 
documents. Eliminating the unused terms would have desirably saved memory as seen in col. 4 
lines 44-45. Only the top set of words and phrases determined by the user would have been used 
and therefore it would have been obvious to have only retained those words and phrases in the 
dictionary. 

Kostoff does not explicitly teach the creation of the word and phrases lists in two separate 
passes through the document. One of ordinary skill in the art at the time of the invention would 
have known how to create the two lists in separate passes through the document. It would have 
been obvious to one of ordinary skill in the art at the time the invention was made to use their 
skill in the art to have created each list as a result of each of two passes through the document. 
This would have been obvious and necessary in order to create the second list since the phrase 
selection would have been dependent on the contents of the first list. 

Regarding independent claim 11, Kostoff teaches determining a frequency of each 
word in each document in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. 
Kostoff teaches creating a table of most frequently occurring words in the documents in fig. 2, 
table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 1 1 . Kostoff teaches determining a 
frequency of phrases in each document that could contain only words in a table in fig. 2, table 1, 
col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches outputting the most 
frequently occurring words and most frequently occurring phrases as a dictionary in fig. 2 and 
col. 4 lines 64-68. 

Kostoff does not specifically teach inputting a maximum dictionary size and limiting the 
dictionary to the inputted maximum dictionary size, such that the dictionary contains less than all 
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words in the documents. However, Kostoff does acknowledge the importance and limitation of 
memory size for storing a list of trivial words in col. 4 lines 44-45. This list is a precursor to the 
dictionary, however it teaches one of ordinary skill in the art at the time of the invention the 
relevance of memory storage size. Kostoff also teaches selecting a portion of the word and 
phrase dictionary in col. 5 line 59 - col. 6 line 64. Kostoff uses an example of selecting the 60 
most often repeated phrases. Kostoff notes that more or less than 60 most often repeated phrases 
may be selected at the discretion of the user. 

In light of these teachings of Kostoff, one of ordinary skill in the art at the time of the 
invention would have truncated the dictionary of Kostoff at the user inputted number of most 
often repeated phrases in the event the dictionary had to reside within a limited memory storage. 
The teaching of Kostoff of possible memory storage constraints having an impact on a list size in 
col. 4 lines 44-45 would have motivated and taught insight to the person of ordinary skill in the 
art at the time of the invention to have made this modification. It would have been obvious to 
one of ordinary skill in the art at the time of the invention to have discarded the less frequent 
terms below the population threshold inputted by the user because they would not have been of 
further use in determining the themes of the text to prepare it for clustering with other 
documents. Eliminating the unused terms would have desirably saved memory as seen in col. 4 
lines 44-45. Only the top set of words and phrases determined by the user would have been used 
and therefore it would have been obvious to have only retained those words and phrases in the 
dictionary. 



Application/Control Number: 09/629,83 1 Page 7 

Art Unit: 2176 

5, Claims 2-5, 7-10, and 12-17 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kostoff et al. (hereafter referred to as Kostoff), US 5,440,481 patented 
08/08/1995 as applied to claims 1, 6, and 11 above, and further in view of Kirsch et al. 
(hereafter referred to as Kirsch), US 6,070,158 filed 11/13/1997, Kobayashi, US 5,742,834 
patented 04/21/1998 and Turney, US 6,470,307 Bl filed 06/23/1997. 

Regarding dependent claim 2, Kostoff teaches adding words to a dictionary table in fig. 
2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining the 
frequency of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 
65 - col. 7 line 1 1 . Kostoff teaches removing words below a frequency level from the dictionary 
table in col. 6 lines 2-64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
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have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 3, Kostoff teaches inputting one or more stop words, 
synonyms and a frequency level in col. 4 lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 4, Kostoff teaches adding words to a table in fig. 2, table 1, 
col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining the frequency 
of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 
line 11. Kostoff teaches removing words below a frequency level from the table in col. 6 lines 2- 
64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 
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It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 5, Kostoff teaches inputting one or more stop words, 
synonyms and a frequency level in col. 4 lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 7, Kostoff teaches adding words to a dictionary table in fig. 
2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining the 
frequency of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 
65 - col. 7 line 11. Kostoff teaches removing words below a frequency level from the dictionary 
table in col. 6 lines 2-64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
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teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 8, Kostoff teaches inputting one or more stop words, 
synonyms and a frequency level in col. 4 lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 9, Kostoff teaches adding words to a table in fig. 2, table 1, 
col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 1 1 . Kostoff teaches determining the frequency 
of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 
line 1 1 . Kostoff teaches removing words below a frequency level from the table in col. 6 lines 2- 
64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
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the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 10, Kostoff teaches inputting one or more stop words, 
synonyms and a frequency level in col. 4 lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 12, Kostoff teaches adding words to a dictionary table in 
fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining 
the frequency of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 
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line 65 - col. 7 line 11. Kostoff teaches removing words below a frequency level from the 
dictionary table in col. 6 lines 2-64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 13, Kostoff teaches inputting one or more stop words, 
synonyms and a frequency level in col. 4 lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 
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Regarding dependent claim 14, Kostoff teaches adding words to a table in fig. 2, table 
1, col. 4 lines 50-68, and col. 6 line 65 - col. 7 line 11. Kostoff teaches determining the 
frequency of each word remaining n the table in fig. 2, table 1, col. 4 lines 50-68, and col. 6 line 
65 - col. 7 line 11. Kostoff teaches removing words below a frequency level from the table in 
col. 6 lines 2-64. 

Kostoff does not teach removing punctuation and case from the documents. Kostoff does 
not teach removing stop words from the document. Kostoff does not teach replacing words in 
the documents with synonyms. Kostoff does not teach removing duplicate words from the 
documents. Kirsch teaches removing punctuation and case from the documents in col. 12 lines 
5-7. Kirsch teaches removing stop words from the document in col. 12 lines 13-15. Kobayashi 
teaches replacing words in the documents with synonyms in fig. 3, 34-35, and col. 1 line 54 - 
col. 2 line 13. Turney teaches removing duplicate words from the documents in col. 5 lines 37- 
38. 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to have combined Kirsch, Kobayashi, and Turney into Kostoff to have created the 
claimed invention. It would have been obvious and desirable to have combined the punctuation 
and stop word removal technique of Kirsch into Kostoff so that the documents passes would 
have been more efficient. It would have been obvious and desirable to have combined the 
synonym word replacement of Kobayashi into Kostoff so that the word counts could have been 
uniform across all of the documents, which would have yielded the most accurate clustering 
results. It would have been obvious and desirable to have combined the duplicate word removal 
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of Turney into Kostoff so that the lists would have been uniform among all the documents in the 
cluster. This would have yielded the most accurate clustering results among the documents. 

Regarding dependent claim 15, Kostoff teaches inputting stop words in col. 4 lines 39- 
49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 16, Kostoff teaches inputting synonyms in col. 4 lines 39- 
49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Regarding dependent claim 17, Kostoff teaches inputting a frequency level in col. 4 
lines 39-49, col. 5 lines 59-64, and col. 6 lines 60-64. 

Response to Arguments 

6. Applicant's arguments filed 9/27/2004 have been fully considered but they are not 
persuasive. Regarding Applicant's arguments in pages 7-9 that the claimed invention is not 
obvious over Kostoff et al. (hereinafter "Kostoff), the Examiner respectfully disagrees. The 
Examiner admits Kostoff does not directly anticipate the claimed invention. The primary 
difference identified by the Examiner is that, while Kostoff teaches limiting the output of the 
dictionary to the user to a specific set of high frequency words and phrases, Kostoff does not 
teach discarding the unused portion of the dictionary. The Examiner assumes that Kostoff must 
have sufficient memory to store the entire dictionary. Kostoff does provide the user with a 
means for limiting the output to a specific number of words and phrases, but does not directly 
offer an ability to limit the dictionary size itself. Kostoff does however teach the elimination of 
trivial words so as to contain the word list within memory. Therefore, Kostoff is aware of 
memory limitations, but does not use any sort of truncation upon the dictionary itself because 
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there must be enough memory to store the full word dictionary. The Examiner believes that one 
of ordinary skill in the art at the time of the invention would have obviously used the list 
truncation technique taught by Kostoff to have truncated the dictionary itself in order to fit the 
dictionary within a specific memory constraint. The Examiner believes a natural and obvious 
place where the person of ordinary skill would have truncated the dictionary would have been at 
the word and phrase frequency threshold number. This would have been an obvious 
modification because the words and phrases below the threshold are not used by the user. 
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Conclusion 



Any inquiry concerning this communication or earlier communications from the 



examiner should be directed to Peter J Smith whose telephone number is 571-272-4101. The 
examiner can normally be reached on Mondays-Fridays 7:00am-3 :30pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Joseph H Feild can be reached on 571-272-4090. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



PJS 

12/9/2004 




ISORY PATENT EXAMINE. 



